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Method and System for Multiplication of Binary Numbers 

Cross Reference to Related Applications 

5 This application claims priority from United States provisional application serial 

number 60/405,241, filed August 22, 2002, entitled "Method and System for 
Multiplication of Binary Numbers", the disclosure of which is incorporated herein by 
reference. 

Technical Field 

10 The present invention relates to an efficient method and system for multiplying 

binary numbers. 

Background Art 

Two N-bit binary numbers A = On-i &n-2 • • ai ao] and B = [bw-i bjvj-2 • • bi b 0 ] are 
commonly multiplied as shown in Fig.l (prior art). Here, the multiplication of two 8-bit 

15 numbers A = 185 dec = [1 0 1 1 1 0 0 1] and B = 237 dec = [1 1 1 0 1 1 0 1] to form a 
product 104 is depicted. 

The product A*B is the sum of single partial products at particular binary 
positions. The single partial products are either number B or zero, dependent on the 
associated bit within number A. For example, the LSB of A, a 0 = 1, and thus the partial 

20 product at binary position 2° is number B = [l 1101 101]. The neighboring bit aj = 0, 
and thus the associated partial product is [0 0 0 0 0 0 0 0]. As used in this description 
and the accompanying claims, the array of partial products 102 formed by this common 
method of multiplication shall be referred to as a "diamond-array". The diamond-array 
102 depicted in Fig. 1 requires that 64 digits be added to form the product. 

25 Fig. 2 (prior art) shows multiplication using a well-known Booth scheme. 

Multiplication using the Booth methodology reduces the number of partial products in a 
diamond-array by a factor of 2. This is accomplished by representing one of the 
multiplicands by numbers whose binary weights differ by at least a factor 4. However, 
the Booth scheme disadvantageously requires that negative numbers be introduced. For 

30 example, number A = 185 dec = [101 1 1001] is commonly regarded as a sum of 
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positive terms: 2 7 +2 5 +2 4 +2 3 +2°. Following the Booth scheme, the number A is instead 
represented as 2 8 -2 6 -2*2 2 +2°. Thus, the negative partial sums have to be represented as 
twos compliment numbers. 

Note that at least every second binary position of A is necessarily zero. However, 
5 the representation of negative numbers by twos compliment requires leading sequences of 
ones, which significantly reduce the benefit of the approximately 50% reduction in the 
number of partial products 201. For N = 8, 60 digits remain to be added using the Booth 
methodology. 

Summary 

10 In accordance with one aspect of the invention, a multiplier for multiplying a first 

binary number a = [a N -i . . . ai ao] and a second binary number b = |>n-i • bi bo] is 
presented. The multiplier includes a first port for receiving a first signal representing the 
binary number a, and a second port for receiving a second signal representing the binary 
number b. A triangle array is operatively coupled to the first signal and the second signal. 

15 An adder adds elements of the triangle array to produce a third signal representing a 
product of the first signal and the second signal. 

In accordance with another aspect of the invention, a processor for multiplying a 
first signal representing a first binary number a = [a N -i - . . ai a 0 ] and a second signal 
representing a second binary number b = [bN-i . • • bi bo] is presented. The processor 

20 includes input means for receiving the first signal and the second signal. The processor 
also includes means for forming a triangle array as a function of the first signal and the 
second signal. An adder adds elements of the triangle array to form a third signal 
representing a product of the first signal and the second signal. 

In accordance with still another aspect of the invention, a method for performing 

25 digital signal processing that requires multiplication of a first signal representing a first 
binary number a = [aN-i ... ai ao] and a second signal representing a second binary 
number b = [b N -i . . . bi b 0 ] is presented. The method includes receiving the first signal 
and the second signal. A triangle array is formed from the first binary number and the 
second binary number, the triangle array including lines k = 0 to N-l. The lines k = 0 to 

30 N-l are added to produce a third signal representing a product of the first signal and the 
second signal. 
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In accordance with yet another aspect of the invention, a computer program 
product for use on a computer system for multiplying a first binary number a = [a N -i . . . aj 
ao] and a second binary number b = [b N -i - . . bi b 0 ] is presented. The computer program 
product includes a computer usable medium having computer readable program code 

5 thereon. The computer readable program code includes program code for forming a 
triangle array from the first binary number and the second binary number, the triangle 
array including lines k = 0 to N-l. The computer readable program code also includes 
program code for adding lines k = 0 to N-l. 

In accordance with embodiments related to the above-described embodiments of 

10 the invention, the triangle array includes lines k = 0 to N-l, such that line k = 0 of the 
triangle array is equal to [0 (ao *b 0 )]; and the lines k = 1 to N-l of the triangle array are 
equal to [pi pO d k _i ... di do]. The peak bits [pi p0] for each line k are determined by [pi 
p0] = [0 0] if a k * b k + 1; [pi pO] = [1 0] if [a k * b k = 1 AND c k = 1]; and [pi pO] = [0 1] if 
[a k *b k = 1 AND c k = 0]. The bits [d k _i ... di do] for each line k are determined by: [d k _i 

15 ... d! do] = [0k-i ...0o] if [a k b k ] = [0 0]; [d k _i ... di do] = [a k _i ... ai a 0 ] if [a k b k ] = [0 
1]; [d k _i ... di do] = [b k -i bi b 0 ] if [a k b k ] = [1 0]; and [d k -i ... di do] = [s k _i ... Si s 0 ] 
if [a k b k ] = [1 1]. The sequence s = [s N -2 ... Si s 0 ] is equal to the sum sequence [(a N -2 + 
bN-2) • • • (ai+bi) (ao+bo)] and c = [cn-i . . . ci] is equal to the carry sequence associated 
with the sum sequence s. The triangle array thus formed may be represented by a number 

20 of digits that is approximately 20-50% less than the number of digits required in a 
diamond array. 

Brief Description of the Drawings 

The foregoing features of the invention will be more readily understood by 
reference to the following detailed description, taken with reference to the accompanying 
25 drawings, in which: 

Fig. 1 depicts prior art multiplication of two N-bit binary numbers A and B; 
Fig. 2 depicts prior art multiplication of the binary numbers A and B employing 
Booth's algorithm; 

Fig. 3 depicts diamond arrays with rotated V's, in accordance with one 
30 embodiment of the invention; 

Fig. 4 is a table that shows possible peak-bit and branch configurations of a V, in 
accordance with one embodiment of the invention; 
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Fig. 5 depicts the addition of two branch numbers without the most significant bit, 
resulting in a sum sequence and a carry sequence, in accordance with one embodiment of 
the invention; 

Fig. 6 depicts lower branch values when multiplying the numbers A and B, in 
accordance with one embodiment of the invention; 

Fig. 7 depicts a resulting multiplication scheme prior to removal of zero bits 
positioned above pairs of peak-bits, in accordance with one embodiment of the invention; 

Fig. 8 depicts a resulting multiplication scheme with zero bits positioned above 
the pairs of peak-bits removed, in accordance with one embodiment of the invention; 

Fig. 9 is a table depicting the switching of bit sequences based on bits ak and bk, in 
accordance with one embodiment of the invention; 

Fig. 10 is a block diagram of a multiplier for multiplying a first binary number 
and a second binary number, in accordance with one embodiment of the invention. 

Description 

15 A method and system for efficiently multiplying binary numbers is presented. In 

particular, the method and system includes reducing the number of digits used in 
connection with partial products formed during multiplication. Details of various 
embodiments are discussed below. 

Fig. 3 shows an advantageous way of looking at the diamond-array 102 of Fig. 1, 

20 in accordance with one embodiment of the invention. Instead of looking at the lines of the 
array, the array can be regarded as composed of structures similar to "rotated V's", whose 
peaks are looking to the left-lower corner. The V's have the following general properties: 

(1) Each V has one peak-bit 301 and two branches 303 and 304 with an equal number 
of bits, respectively. The number of bits within a branch can also be zero. For 

25 example, a V may consists of only the peak-bit. 

(2) The peak-bit 301 of each V is the product of the bits of equal binary position 
within numbers A and B, respectively. In Fig.3, two V's 301 are highlighted. The 
corresponding peak-bits (bold) 301 are the products of the first bits (MSB) 

a 7 l>7 = 1, and bits a 3 b 3 = 0, respectively (see Fig. 4, discussed below). 
30 (3) The branches 303 and 304 include either truncated versions of numbers A or B, or 

zeros only. In Fig.3, the V 301 with peak-bit "1" has an upper branch containing 7 
bits of A (i.e., [a6 . . . ai ao]= [0 1 0 1 0 0 1]), and a lower branch consisting of 7 bits 
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of B (i.e., [b 6 . . . bi b 0 ] = [1 1 0 1 1 0 l]). The V 301 with peak bit "0" shows an 
upper branch 303 composed of 4 zeros (i.e., [0000]) and a lower branch 304 
composed of 4 bits of A (i.e., [b 3 b 2 bi b 0 ] = [110 1]). 

(4) Four possible configurations can occur, determined by the two bits within A and 
5 B, that can be used to determine the peak-bit. Fig. 4 is a table that shows these 

four configurations, in accordance with one embodiment of the invention. 

(5) The overall diamond-array is fully covered by exactly N non-overlapping Vs. 

The V ? s are defined by bits a k and bk at binary position k in numbers A and B, as 
10 summarized in Fig. 4. If both bits a k and b k are zero, both branches contain only zeros. 
For a k = 0 and b k = 1, bit sequence [a k _i . . . ai a 0 ] appears in the upper branch, and for 
a k = 1 and b k = 0, sequence [b k _i . . . bi bo] appears in the lower branch. For a k = 1 and 
b k = 1, both sequences [a k _i . . . ai a 0 ] and [b k _i . . . bi b 0 ] have to be considered. In general, 
bits a k and b k can be regarded as switches, where bits a k activate or deactivate the 
15 truncated versions of number B, and bits b k activate or deactivate the truncated versions 
of number A. 

Using the commutative law, upper and lower branches of the V's can be flipped 
arbitrarily, without changing the overall sum. For example, all zero-branches can be 
flipped such that they become upper branches. This causes a concentration of zeros in 

20 the upper left area of the diamond-array, that is, the region above the line of peak-bit 

elements. Only branches of V's whose peak-bit is "1" may contain non-zero elements in 
this region. To also remove these, the two branches are added, and the sum positioned in 
the lower branches. This addition does not need to be done individually for each V of 
particular length. Instead, it can be done once by adding sequences [a N . 2 . . . ai a 0 ] and 

25 [b N -2 ... bi b 0 ], for example, numbers A and B without the most significant bit, 
respectively. The results are sum-sequence [sn-2 ... s\ So], and the carry-sequence 
[c N -i ... c 2 ci]. For the present example, these sequences are shown in Fig.5. 

The carries at each bit position are of interest, since different V's have different 
lengths of branches. If the overall carry after addition of the two branches is "1", the 

30 binary position of this carry is equal to the binary position of the peak-bit "1". Addition 
of the two "l's" results in "10". This leads to the following simple rule: if a carry appears, 
the peak-bit "1" is shifted one position to the left. In the following, single peak-bits are 
replaced by pairs of peak-bits [pi po]. The resulting "lower branches" 602 are 
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summarized for all branch-lengths in Fig. 6, in accordance with one embodiment of the 
invention. 

Note that the first two bits in each line of lower branches 602 in Fig. 6 represent 
the pair of peak-bits [pi pO] (bold), which are either "01" (no carry) or "10" (with carry). 
5 The remaining bits are bit sequences from the sum in Fig. 5, ranging from lengths 1 to 7. 
After flipping branches of V's with peak-bit "0" as described above, and 
representing V's with peak-bit "1" as lower branches 602 according to Fig.6, the 
multiplication scheme looks like Fig.7, in accordance with one embodiment of the 
invention. Note that the single peak-bits are replaced by a pair of bits (bold). 
10 Now all bits above the pairs of peak-bits are zero and thus can be omitted as 

shown in Fig.8, in accordance with one embodiment of the invention. As used in this 
description and the accompanying claims, the resulting array shall generally be referred 
to as a "triangle-array." 

As compared to the scheme depicted in Fig.l, the diamond-array 102 with exactly 

N(N + 3) 

15 N 2 = 64 digits has changed to the triangle-array 801 with exactly = 44 digits, a 

bit-reduction of approximately 30%. For larger N, a bit-reduction of roughly 50% may be 
achieved. This reduction also compares favorably to the Booth-algorithm, which requires 
60 digits to be added for N = 8, considerably more than the 44 digits required for N = 8 in 
the present invention. 

20 Fig. 10 shows a block diagram of a multiplier 1000 for multiplying a first signal 

and a second signal, in accordance with one embodiment of the invention. The first 
signal represents a first binary number a = [a N -i . . . ai ao], and the second signal represents 
a second binary number b = [b N -i . . . bi bo]- Multiplier may be, without limitation, a 
device such as a communications device, a signal processor, a microprocessor, central 

25 processor, and/or computer for operating on data signals. 

The multiplier includes a first port 1001 for receiving the first signal 1005. A 
second port 1002 receives the second signal 1006. 

Operatively coupled to the first port 1001 and the second port 1002 is a triangle- 
array 1003. The triangle array 1003 may be represented in, without limitation, a tangible 

30 medium, such as a register and/or computer readable medium. For example, the triangle- 
array 1003 may be stored, without limitation, on a diskette, CD-ROM, ROM, RAM, or 
fixed disk. 
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The triangle array 1003 includes lines k = 0 to N-l. Line k = 0 of the triangle 
array is equal to [0 (a 0 * b 0 )]. Line k = 0 equal to [pi p 0 ] = [1 1] cannot occur. 

Lines k = 1 to N-l of the triangle array are equal to [pi p0 d K -i . . . dj do]. The pair 
of peak-bits [pi p 0 ] in each line k = 1 to N-l is different from [0 0], only if a k *b N = L 
5 When a k *b k = 1, [pi p 0 ] = [0 1] for c k = 0, and [pi p 0 ] = [1 0] for c k = 1 . 

Each digit in the triangle-array below the pairs of peak-bits (i.e., [d k .i . . . di do]) 
may be controlled by bits a k and b k according to the table shown in Fig. 9, in accordance 
with one embodiment of the invention. In particular, for each line k, [d k .i ... di do] is set 
to: all zeros, if [a k b k ] = [0 0]; [a k _i ... ai ao] if [a k b k ] = [0 1]; [b k .i ... bj b 0 ] if [a k b k ] 
10 = [1 0]; or [s k _i ... Si s 0 ] if [a k b k ] = [1 1]. Such control may be implemented using a 
multiplexer for each digit, which may be a 4 to 1 multiplexer. The inputs into the 
multiplexer can be [0 0 . . . 0], [a k _i ... ai a 0 ], [b k _i ... bi b 0 ], or [s k _ 2 ... si s 0 ]> which are 
controlled by a k and b k . 

An adder 1004 adds the lines k in the triangle-array 1003 to produce a third signal 
15 1007. The third signal 1007 represents a product of the first signal 1005 and the second 
signal 1006. 

In accordance with various embodiments of the invention, the multiplier may 
include an additional (k-l)-bit adder for addition of numbers A and B without most 
significant bit's to compute the sum sequence [s N -2 • • Si so], and the carry-sequence 
20 [c N -i ... c 2 ci]. See Fig. 5 and accompanying text for details regarding the sum and carry 
sequence. 

Squaring of binary numbers represents a special case of the method and system 
described herein. In a squaring scheme, only symmetrical V's occur, that is, they are 
composed of either 2 branches of zeros, or 2 branches each containing a truncated version 
25 of the number to be squared. The adder to compute the sum-sequence [s N -2 • • • Si s 0 ] and 
the carry-sequence [cn-i . . . c 2 Ci] can be omitted, since the addition of two equal numbers 
can trivially be achieved by shifting the number to the left and adding a tailing zero. 
Additionally, the 4-to-l multiplexers can be replaced by simple AND-gates. 

In various embodiments of the invention, the disclosed system and method for 
30 multiplying binary numbers may be implemented as a computer program product for use 
with a computer system or processor. Such implementation may include a series of 
computer instructions fixed either on a tangible medium, such as a computer readable 
medium (e.g., a diskette, CD-ROM, ROM, or fixed disk) or transmittable to a computer 
system, via a modem or other interface device, such as a communications adaptor 
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connected to a network over a medium. The medium may be either a tangible (e.g., 
optical or analog communications lines) or a medium implemented with wireless 
techniques (e.g., microwave, infrared, or other transmission techniques). The series of 
computer instructions embodies all or part of the functionality previously described 

5 herein with respect to the system and method. Those skilled in the art should appreciate 
that such computer instructions can be written in a number of programming languages for 
use with may computer architectures or operating systems. Further, such instructions may 
be stored in any memory device, such as a semiconductor, magnetic, optical or other 
memory devices, and may be transmitted using any communications technology, such as 

10 optical, infrared, microwave, or other transmission technologies. It is expected that such a 
computer program product may be distributed as a removable medium with 
accompanying printed or electronic documentation (e.g., shrink wrapped software), 
pre-loaded with a computer system (e.g., on system ROM or fixed disk), or distributed 
from a server or electronic bulletin board over a network (e.g., the Internet or World 

15 Wide Web). Of course, some embodiments of the invention may be implemented as a 
combination of both software (e.g., a computer program product) and hardware. Still 
other embodiments of the invention are implemented as entirely hardware, as discussed 
previously, or entirely software (e.g., a computer program product). 

The present invention may be embodied in still other specific forms without 

20 departing from the true scope of the invention. The described embodiments are to be 
considered in all respects only as illustrative and not restrictive. 
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